AITopics | spherical image

Collaborating Authors

spherical image

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

1e6e0a04d20f50967c64dac2d639a577-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-11-2026, 15:55:54 GMT

developmental algorithm, hardware, node, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.33)

Add feedback

Learning Spherical Convolution for Fast Features from 360° Imagery

Yu-Chuan Su, Kristen Grauman

Neural Information Processing SystemsNov-21-2025, 04:28:05 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, projection, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas > Travis County > Austin (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)
Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (0.46)

Add feedback

1e6e0a04d20f50967c64dac2d639a577-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 08:07:39 GMT

We have also shown scaling of the developmental algorithm with the network size in supplementary (S5).

artificial intelligence, developmental algorithm, machine learning, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.33)

Add feedback

Learning Spherical Convolution for Fast Features from 360° Imagery

Yu-Chuan Su, Kristen Grauman

Neural Information Processing SystemsOct-2-2024, 17:37:31 GMT

While 360 cameras offer tremendous new possibilities in vision, graphics, and augmented reality, the spherical images they produce make core feature extraction non-trivial. Convolutional neural networks (CNNs) trained on images from perspective cameras yield "flat" filters, yet 360 images cannot be projected to a single plane without significant distortion. A naive solution that repeatedly projects the viewing sphere to all tangent planes is accurate, but much too computationally intensive for real problems. We propose to learn a spherical convolutional network that translates a planar CNN to process 360 imagery directly in its equirectangular projection. Our approach learns to reproduce the flat filter outputs on 360 data, sensitive to the varying distortion effects across the viewing sphere. The key benefits are 1) efficient feature extraction for 360 images and video, and 2) the ability to leverage powerful pre-trained networks researchers have carefully honed (together with massive labeled image training sets) for perspective images. We validate our approach compared to several alternative methods in terms of both raw CNN output accuracy as well as applying a state-of-the-art "flat" object detector to 360 data. Our method yields the most accurate results while saving orders of magnitude in computation versus the existing exact reprojection solution.

convolution, distortion, equirectangular projection, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas > Travis County > Austin (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (0.66)

Add feedback

Geometry Fidelity for Spherical Images

Christensen, Anders, Mojab, Nooshin, Patel, Khushman, Ahuja, Karan, Akata, Zeynep, Winther, Ole, Gonzalez-Franco, Mar, Colaco, Andrea

arXiv.org Artificial IntelligenceJul-25-2024

Spherical or omni-directional images offer an immersive visual format appealing to a wide range of computer vision applications. However, geometric properties of spherical images pose a major challenge for models and metrics designed for ordinary 2D images. Here, we show that direct application of Fr\'echet Inception Distance (FID) is insufficient for quantifying geometric fidelity in spherical images. We introduce two quantitative metrics accounting for geometric constraints, namely Omnidirectional FID (OmniFID) and Discontinuity Score (DS). OmniFID is an extension of FID tailored to additionally capture field-of-view requirements of the spherical format by leveraging cubemap projections. DS is a kernel-based seam alignment score of continuity across borders of 2D representations of spherical images. In experiments, OmniFID and DS quantify geometry fidelity issues that are undetected by FID.

projection, representation, spherical image, (15 more...)

arXiv.org Artificial Intelligence

2407.18207

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
Europe > Denmark > Capital Region > Copenhagen (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.95)
(3 more...)

Add feedback

3D Scene Geometry Estimation from 360$^\circ$ Imagery: A Survey

da Silveira, Thiago Lopes Trugillo, Pinto, Paulo Gamarra Lessa, Llerena, Jeffri Erwin Murrugarra, Jung, Claudio Rosito

arXiv.org Artificial IntelligenceJan-17-2024

The world is three-dimensional (3D). As such, recovering 3D information about real-world objects allows the exploration of many relevant applications, including self-driving cars [1, 2], robot navigation [3, 4], virtual tourism [5, 6], infrastructure inspection [7, 8], archaeological [9, 10] and architectural modeling [5, 11], city planning [12, 13], and 3D cinema [14, 15]. Many sensors can be used to obtain 3D data from real objects, such as light detection and ranging [16], structured light [17], and time of flight [18]. There is a plethora of approaches for inferring 3D information from plain color images/videos. The widespread accessibility and low-cost of consumer cameras is a strong motivation for the continued research efforts devoted to image-based 3D scene reconstruction methods [19]. In theory, 3D information can only be inferred from two or more captures of the scene, as in typical multi-view stereo [20] or structure from motion [21] approaches. However, recent approaches are exploring machine learning to perform single-image depth inference [22, 23, 24]. Most methods developed so far rely on traditional perspective/pinhole-based cameras, which have a narrow field of view (FoV) and thus might require thousands of captures to model large scenes [25, 26].

computer vision, estimation, international conference, (12 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3519021

2401.09252

Country:

North America > Canada > Newfoundland and Labrador > Labrador (0.04)
South America > Brazil > Rio Grande do Sul (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > Switzerland > Geneva > Geneva (0.04)

Genre:

Overview (1.00)
Summary/Review (0.92)
Research Report > New Finding (0.67)

Industry:

Media > Photography (1.00)
Transportation (0.68)
Media > Film (0.67)
(3 more...)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision > Image Understanding (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

Attention-Enhanced Cross-modal Localization Between 360 Images and Point Clouds

Zhao, Zhipeng, Yu, Huai, Lyv, Chenwei, Yang, Wen, Scherer, Sebastian

arXiv.org Artificial IntelligenceDec-6-2022

Visual localization plays an important role for intelligent robots and autonomous driving, especially when the accuracy of GNSS is unreliable. Recently, camera localization in LiDAR maps has attracted more and more attention for its low cost and potential robustness to illumination and weather changes. However, the commonly used pinhole camera has a narrow Field-of-View, thus leading to limited information compared with the omni-directional LiDAR data. To overcome this limitation, we focus on correlating the information of 360 equirectangular images to point clouds, proposing an end-to-end learnable network to conduct cross-modal visual localization by establishing similarity in high-dimensional feature space. Inspired by the attention mechanism, we optimize the network to capture the salient feature for comparing images and point clouds. We construct several sequences containing 360 equirectangular images and corresponding point clouds based on the KITTI-360 dataset and conduct extensive experiments. The results demonstrate the effectiveness of our approach.

artificial intelligence, machine learning, point cloud, (18 more...)

arXiv.org Artificial Intelligence

2212.02757

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
Asia > China > Hubei Province > Wuhan (0.04)

Genre: Research Report > New Finding (0.48)

Industry: Information Technology > Services (0.35)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Learning Spherical Convolution for Fast Features from 360° Imagery

Su, Yu-Chuan, Grauman, Kristen

Neural Information Processing SystemsDec-31-2017

While 360° cameras offer tremendous new possibilities in vision, graphics, and augmented reality, the spherical images they produce make core feature extraction non-trivial. Convolutional neural networks (CNNs) trained on images from perspective cameras yield “flat" filters, yet 360° images cannot be projected to a single plane without significant distortion. A naive solution that repeatedly projects the viewing sphere to all tangent planes is accurate, but much too computationally intensive for real problems. We propose to learn a spherical convolutional network that translates a planar CNN to process 360° imagery directly in its equirectangular projection. Our approach learns to reproduce the flat filter outputs on 360° data, sensitive to the varying distortion effects across the viewing sphere. The key benefits are 1) efficient feature extraction for 360° images and video, and 2) the ability to leverage powerful pre-trained networks researchers have carefully honed (together with massive labeled image training sets) for perspective images. We validate our approach compared to several alternative methods in terms of both raw CNN output accuracy as well as applying a state-of-the-art “flat" object detector to 360° data. Our method yields the most accurate results while saving orders of magnitude in computation versus the existing exact reprojection solution.

artificial intelligence, human computer interaction, machine learning, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (0.66)

Add feedback